Picture for Qiuyang Mang

Qiuyang Mang

OpenDeepThink: Parallel Reasoning via Bradley--Terry Aggregation

Add code
May 14, 2026
Viaarxiv icon

FrontierSmith: Synthesizing Open-Ended Coding Problems at Scale

Add code
May 14, 2026
Viaarxiv icon

Do Androids Dream of Breaking the Game? Systematically Auditing AI Agent Benchmarks with BenchJack

Add code
May 12, 2026
Viaarxiv icon

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

Add code
Apr 05, 2026
Viaarxiv icon

SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing

Add code
Mar 09, 2026
Viaarxiv icon

AdaEvolve: Adaptive LLM Driven Zeroth-Order Optimization

Add code
Feb 23, 2026
Viaarxiv icon

FrontierCS: Evolving Challenges for Evolving Intelligence

Add code
Dec 17, 2025
Figure 1 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 2 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 3 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 4 for FrontierCS: Evolving Challenges for Evolving Intelligence
Viaarxiv icon

Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards

Add code
Oct 09, 2025
Viaarxiv icon

Retromorphic Testing: A New Approach to the Test Oracle Problem

Add code
Oct 10, 2023
Viaarxiv icon

Automated Testing and Improvement of Named Entity Recognition Systems

Add code
Aug 14, 2023
Figure 1 for Automated Testing and Improvement of Named Entity Recognition Systems
Figure 2 for Automated Testing and Improvement of Named Entity Recognition Systems
Figure 3 for Automated Testing and Improvement of Named Entity Recognition Systems
Figure 4 for Automated Testing and Improvement of Named Entity Recognition Systems
Viaarxiv icon